k-Shot Learning for Action Recognition
نویسندگان
چکیده
In the problem of k-shot learning, a model must learn to reliably classify an example having seen only k previous instances of examples of the same class. With recent success in using memory in neural networks to perform kshot learning, we propose a technique that uses MemoryAugmented Neural Networks to perform k-shot learning for action recognition in videos. We believe the use of memory will help learn encoded generalizable representations of actions that can be used for classification. We use the Memory-Augmented Neural Network framework to evaluate k-shot learning for the Kinetics Dataset of action clips taken from YouTube videos. Our approach works well for this domain, achieving results close to 78% accuracy in the 9-shot learning case.
منابع مشابه
Multi-Label Zero-Shot Human Action Recognition via Joint Latent Embedding
Human action recognition refers to automatic recognizing human actions from a video clip, which is one of the most challenging tasks in computer vision. Due to the fact that annotating video data is laborious and timeconsuming, most of the existing works in human action recognition are limited to a number of small scale benchmark datasets where there are a small number of video clips associated...
متن کاملGeneralized Zero-Shot Learning for Action Recognition with Web-Scale Video Data
Action recognition in surveillance video makes our life safer by detecting the criminal events or predicting violent emergencies. However, efficient action recognition is not free of difficulty. First, there are so many action classes in daily life that we cannot pre-define all possible action classes beforehand. Moreover, it is very hard to collect real-word videos for certain particular actio...
متن کاملOne Shot Similarity Metric Learning for Action Recognition
The One-Shot-Similarity (OSS) is a framework for classifierbased similarity functions. It is based on the use of background samples and was shown to excel in tasks ranging from face recognition to document analysis. However, we found that its performance depends on the ability to effectively learn the underlying classifiers, which in turn depends on the underlying metric. In this work we presen...
متن کاملAlternative Semantic Representations for Zero-Shot Human Action Recognition
A proper semantic representation for encoding side information is key to the success of zero-shot learning. In this paper, we explore two alternative semantic representations especially for zero-shot human action recognition: textual descriptions of human actions and deep features extracted from still images relevant to human actions. Such side information are accessible on Web with little cost...
متن کاملKeep It Simple and Sparse: Real-Time Action Recognition
Sparsity has been showed to be one of the most important properties for visual recognition purposes. In this paper we show that sparse representation plays a fundamental role in achieving one-shot learning and real-time recognition of actions. We start off from RGBD images, combine motion and appearance cues and extract state-of-the-art features in a computationally efficient way. The proposed ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017